A New Perspective on Semantics of Data Provenance
نویسندگان
چکیده
Data Provenance refers to the “origin”, “lineage”, and “source” of data. In this work, we examine provenance from a semantics perspective and present the W7 model, an ontological model of data provenance. In the W7 model, provenance is conceptualized as a combination of seven interconnected elements including “what”, “when”, “where”, “how”, “who”, “which” and “why”. Each of these components may be used to track events that affect data during its lifetime. The W7 model is general and extensible enough to capture provenance semantics for data in different domains. Using the example of the Wikipedia, we illustrate how the W7 model can capture domain or application specific provenance.
منابع مشابه
Reexamining Some Holy Grails of Data Provenance
We reconsider some of the explicit and implicit properties that underlie well-established definitions of data provenance semantics. Previous work on comparing provenance semantics has mostly focused on expressive power (does the provenance generated by a certain semantics subsume the provenance generated by other semantics) and on understanding whether a semantics is insensitive to query rewrit...
متن کاملمروری بر مطالعات اُبسیدین در ایران، منشأیابی معادن و اُبسیدین های محوطه های باستانی، پژوهش ها و پرسش های موجود
Obsidian artifacts is frequently used materials in prehistory and found widely in archaeological sites. Provenance studies of obsidian has been an issue of intense research and debate between archaeologists and geologists. Since different provenance studies has been carried out from 1960s up to 2015 in Anatolia and Caucasus but obsidian studies in Iran is in very early stage and consider as ter...
متن کاملA Provenance Tracking Model for Data Updates
For data-centric systems, provenance tracking is particularly important when the system is open and decentralised, such as the Web of Linked Data. In this paper, a concise but expressive calculus which models data updates is presented. The calculus is used to provide an operational semantics for a system where data and updates interact concurrently. The operational semantics of the calculus als...
متن کاملOn the Use of Semantic Annotations for Supporting Provenance in Grids
There has seen a strong demand for provenance in grid applications, which enables users to trace how a particular result has been arrived at by identifying the resources, configurations and execution settings. In this paper we analyses the requirements of provenance support and discusses the nature and characteristics of provenance data on the Grid. We define a new conception called augmented p...
متن کاملTracing where and who provenance in Linked Data: A calculus
Linked Data provides some sensible guidelines for publishing and consuming data on the Web. Data published on the Web has no inherent truth, yet its quality can often be assessed based on its provenance. This work introduces a new approach to provenance for Linked Data. The simplest notion of provenance – viz., a named graph indicating where the data is now – is extended with a richer provenanc...
متن کامل